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(54) Method for image compression and device for implementing the method 



(57) The invention relates to a method of image 
compression, especially of the MPEG2 type, in which 
the images are coded according to groups (GOPs) each 
of which comprises N images, with an I image coded in 
intra mode, P images predicted as a function of the intra 
image I or of the preceding P image, each P image being 
preceded and followed by n bidirectionally predicted B 
images, n possibly being zero. The number M = n + 1 



represents the structure of the group. 

At least one parameter characterizing the source 
images which are to be coded according to a group is 
determined using a test coding (70), and the numbers 
N and M are made to depend on this parameter or these 
parameters. In the course of the test coding, defined val- 
ues are conferred on N, M, and on the quantisation in- 
terval Q. 
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Description 

[0001] The invention relates to a method of image compression in which the images are coded according to groups 
of variable lengths. 

[0002] It relates more particularly to a method of the MPEG type, particularly of MPEG2 type. Although the invention 
is not limited to this standard, it will be referred to primarily in the remainder of the description. 
[0003] The principle of such compression is reiterated below. 

[0004] In the video MPEG2 standard, compression of the digital video signals is obtained by exploiting the spatial 
redundancy and the temporal redundancy of the coded images. 

[0005] The spatial redundancy is evaluated principally by virtue of a succession of three operations: an operation 
commonly called discrete cosine transform and denoted DCT ("Discrete Cosine Transform"), an operation of quanti- 
sation of the coefficients arising from the DCT and an operation of variable-length coding to describe the quantified 
coefficients arising from the DCT 

[0006] The temporal redundancy is analysed by a movement-compensation operation which consists, by translation 
of each block of the current image, in searching for the most similar block situated in the reference image. The analysis 
of the temporal redundancy leads to a field of translation vectors being determined, commonly called movement vectors, 
as well as a prediction error which is the difference between the signal of the current image and the signal of the image 
predicted by movement compensation. The prediction error is then analysed according to the principle of spatial re- 
dundancy. 

[0007] MPEG coding is of predictive type. It follows that the decoding which is associated with it should be regularly 
reinitialised so as to protect the signal against any transmission error or any break in signal due to the decoder being 
switched over from one programme to another. 

[0008] To this end, the MPEG2 standard provides that, periodically, the images should be coded in spatial mode, 
that is to say according to a mode exploiting only spatial redundancy. The images coded in spatial mode are called 

2 5 INTRA images or I images. 

[0009] The images coded by exploiting temporal redundancy are of two types: on the one hand, the images con- 
structed by reference to a temporally previous image on the basis of a front prediction and, on the other hand, the 
images constructed by reference to two temporally previous and subsequent images on the basis of a front prediction 
and of a back prediction. I 

30 [001 0] The coded images constructed on the basis of a front prediction are called predicted images or P images and 
the coded images constructed on the basis of a front and of a back prediction are called bidirectional images or B 
images. 

[0011] An I image is decoded without reference beiqg made to images other than itself. A P image is decoded by 
referring to the P or I image which precedes it. A B image is decoded by relying on the I or B image which precedes it 
35 and on the I or P image which follows it. 

[0012] The periodicity of the I images defines a group of images widely denoted GOP ("Group Of Pictures"). 
[001 3] Within a single GOP, the quantity of data contained in an I image is generally greater than the quantity of data 
contained in a P image and the quantity of data contained in the P image is generally greater than the quantity of data 
contained in a P image. 

40 [0014] At 50 Hertz, the GOP is presented as an I image followed by a sequence of B and P images which, most of 

the time, exhibits the following sequence 
I, B, B, P, B, B, P, B, B, P, B, B. 

[001 5] However, the standard does not demand N = 12 images being provided in a GOR as is the general case, nor 

that the distances M between two P images should always be equal to 3. More precisely, the distance M is the number 
45 n of B images preceding or following a P image : increased by one unit, i.e. M = N +1. 

[0016] The number N represents the size or length of the GOP, while the number M represents its structure. 

[0017] The invention results from the observation that it is possible to act on the M and N parameters to enhance 

the level of compression and/or enhance the quality of the coding. 

[0018] The method of coding according to the invention is characterised in that at least one parameter is determined 
so characterising the source images which are to be coded according to a group and in that the length and the structure 
of the group is made to depend on this parameter or these parameters. 

[0019] In one embodiment, the parameter(s) characterising the source images is or are determined with the aid of 
a test coding in the course of which defined values are allocated to N, M and to the quantisation interval Q. 
[0020] The test coding is carried out, for example, in open loop. 
ss [0021] In one particularly simple embodiment, a parameter (Pcost) characterising the P images obtained during the 
test coding and a parameter (Bcost) characterising the B images obtained during the test coding are determined sep- 
arately, these parameters characterising the P and B images being, preferably, the average costs of coding of the P 
and B images. The cost of coding an image is the number of bits (headers included) which is necessary for the coding. 
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[0022] In this case, the number N can be made to depend on the parameter characterising the P images and the 
number M on the parameter characterising the B images. 

[0023] During trials carried out in the context of the invention, on sequences of images of various types, it was noted 
that, for each type of sequence, an optimal number N existed providing a minimum coding cost (or throughput) for the 
5 P images and an optimal number M providing a minimum coding cost (or throughput) for the B images, these costs 
being obtained during the test coding. These sequences are distinguished from another by movement of variable 
amplitudes, different objects, different spatial definitions and different contents. 

[0024] It was noted, moreover, that a practically linear relationship exists between the optimal number N and the 
throughput of the P images. Likewise, a practically linear relationship exists between the number M and the throughput 
10 of the B images. Hence, knowing the throughputs of the P and B images, it is easy to calculate the numbers N and M 
providing the best results. 

[0025] In an example corresponding to the MPEG2 standard, 50 Hz, the test coding is carried out with N = 12, M = 
3 and Q = 15, the relationship between N and tha throughput of the P images is approximately as follows: 

(1) N = INT +1, with 12 < N < 30 

L K>000 J 

and the relationship between M and the throughput, or cost, Bcost of the B images is as follows: 

20 

^.-r-ri 7 90G0 ~ Bcostl . 
(2) N = INT +1, with 1 < M < 7. 

L 20000 J 

2B [0026] It is also possible to limit M to 5. 

[0027] In these formulae, INT signifies the integer part. 

[0028] The limitation on N between 12 and 30 and the limitation on M to a maximum value of 7 makes it possible to 
have a simple embodiment of the coders and to limit the programme-changing time. With the same aim, it is also 
possible to impose other limitations or constraints, particularly that M be constant in the GOP and/or that it be a sub- 
30 multiple of N. 

[0029] In one embodiment, if the values of N and of M taken individually and together are not compatible with the 
constraints, the values of M and of N closest to the calculated values and which satisfy the stipulated compatibility will 
be chosen. In this case, the value of M will be favoured, that is to say that if a choice has to be made between several 
M, N pairs, the pair will be chosen for which the M value is closest to that which results from the calculation. 
35 [0030] The formula (2) above applies only if Bcost does not exceed 179800. In the opposite case, that is to say if 
Bcost > 1 79000, experiment has shown that it was necessary, in this example, for M to be chosen in the following way: 



40 



(3) M = S.INT P 003 t - ll , with 1 < M < 7. 

Lb cos t J 



[0031] If the cost of a B image is higher than the cost of a P image, it is preferable for the GOP to contain no B image, 
that is to say M = 1 . This is because the P images, exhibiting a better prediction quality than the B images and being, 
by assumption, of lower cost, the presence of such B images would constitute a drawback in this case. 

45 [0032] The costs, in bits, of each P image and of each B image are determined, for example, as and when these 
images appear. In one embodiment, the values of M and N are selected by taking an average over all the P and B 
images of the test coding, the coding proper being carried out only after the test coding of N source images, N being 
determined by the cost of coding the P images. In this case, the parameter M may remain constant in the GOR 
[0033] In another embodiment, which allows a more rapid adaptation to the variations in content of the scenes as 

so well as a reduction in the delay between the arrival of the source images and coding proper (and which thus allows a 
lower-capacity buffer memory), the coding proper is started as soon as the test coding supplies data allowing this 
starting. Hence, the first B image of the test coding provides an M number allowing the coding to be started and the N 
number is supplied by the first P image of the test coding. It is also possible to have the coding start only after the test 
coding of the first P image; in this case, the coding starts when a value of N and a value of M are known. 

55 [0034] With this type of coding "on the fly 0 , the number M r that is to say the structure, may vary within a GOR which 
allows a more rapid adaptation to the variations in content of the scene. 

[0035] In the coding carried out progressively, the GOP is interrupted when the number of images already coded in 
the current GOP is at least equal to the measured number N (measured by Pcost in the above example), or upon a 
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change of scene. 

[0036] In order to avoid significant variations in the parameters between groups which follow one another, it may 
prove to be worthwhile to depart from the calculated values. For example, if the calculation shows that, for a large part 
of the length of the GOP, for example at least 80%, M = 1 would be necessary, whereas, for the rest of the GOP the 
5 calculation shows that M should be greater than 1 , the value 1 will be adopted for M, despite everything, even if the 
calculation shows that a different value is necessary. 

[0037] Likewise, if for the preceding GOP, M = 1 and if, for the current GOP, the calculation shows that a value M = 
1 would be necessary for a significant part of the cu rrent GOR for example at least 60%, the value 1 will also be adopted 
for M, even if the result of the calculation, as it results from the formula (2) above, implies a different value. 
io [0038] It is known that when a change of scene occurs, that is to say when a discontinuity appears in the sequence 
of video images, it is necessary to adapt the GOP image groups on either side of the discontinuity so that the new 
group, which starts with an I image, corresponds to the new scene. 

[0039] In one embodiment, if the change of scene occurs in a group, the new scene constitutes the I image of a new 
group, the affected group being shortened so as to stop before this new scene if the change of scene occurs in the 

15 affected group, at a distance from the start at least equal to the minimum number allowable for N. The start of the 
affected group is used to lengthen the group which precedes it when the sum of the number of images preceding the 
change of scene in the affected group and of the number of images of the group which precedes it does not exceed 
the maximum admissible for N. In this preceding group thus modified (shortened or lengthened), it may be necessary 
to modify the number M previously calculated for this GOP. 

20" [0040] In one variant, which is used for preference in the case in which the length of the affected group is less than 
the minimum allowable for N, when a change of scene occurs in a group, the new scene constitutes the I image of a 
new group, this new group having a length equal to the average of the length of the group before it was affected and 
the length of the group which precedes it. With this variant, it may be necessary to modify the number M previously 
calibrated for the GOPs. 

25 [0041] When two modifications are possible, for example when the length of the affected group is less than the 
minimum allowable for N, a choice may be made between these two modifications by carrying out a calculation, for 
each modification, of the distance of the (M, N) pair obtained or M, N pair before modification and by selecting the pair 
for which the distance is the smallest. 

[0042] In order to determine the parameters N and M, recourse may be had to the measurement of parameters other 
30 than the measurement of the throughputs For example, used may be made, in order to determine N, of the energy of 
the I Intra images. It is also possible to determine the amplitude of the movements or the movement compensation 
error, known as DFD (Displaced Frame Difference) for determining M and N. 

[0043] Other characteristics and advantages of the invention will emerge with the description of some of its embod- 
iments, this description being given by referring to the attached drawings, in which: 

35 

Figure 1 illustrates a macroblock to the 4.2.0 standard, 
Figure 2 is a diagram illustrating the DCT transform, 

Figure 3 shows a group of images, GOP, according to the MPEG standard or similar standard, 
Figures 4 to 7 are diagrams illustrating the method in accordance with the invention, and 
40 Figure 8 is a diagram of a layout for implementing the method according to the invention. 

[0044] Reference will be made firstly to Figures 1 to 3 which aim to reiterate certain principles used in MPEG2 coding. 
[0045] In the MPEG2 standard, a start point may be an image including, in progressive mode, 576 lines each of 720 
points. In interlaced mode, this image consists of two frames each of which comprises 288 lines, each also of 720 points. 

45 [0046] Each image is broken down into macroblocks each of which is formed by a square of 1 6X 1 6 luminance points. 
Each macroblock it thus formed of four square blocks of 8x8 luminance points. With each of these four luminance 
blocks, two chrominance blocks, each of which exhibits 8x8 points, are associated (in the 4.2.0 format), one of the 
blocks representing the colour difference signals Cr or red chrominance and the other block representing the colour 
difference signals Cb or blue chrominance. In the 4.2.2 format, with each luminance macroblock are associated four 

so 8X8 chrominance blocks, two blocks for the blue chrominance and two blocks for the red chrominance. There also 
exists a 4.4.4 format for which each of the luminance and chrominance components each includes four 8x8 blocks. 
[0047] Four 8x8 luminance blocks have been represented in Figure 1, referred to overall as 10, and 8X8 chromi- 
nance blocks 12 and 14 for the respectively blue and red chrominances, the whole illustrating a macroblock to the 
4.2.0 standard. 

ss [0048] Each block is coded by using a transform denoted DCT which is a discrete cosine transform which makes it 
possible to transform a luminance block (for example) into a block of coefficients representing spatial frequencies. As 
can be seen in Figure 2, the source block 16 is converted into a block 1 8 of 8x8 coefficients. The upper left corner 20 
of the block 18 corresponds to the zero spatial frequencies (average value of the block) and, from this origin 20, the 
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10 either intra or inter. In the case of an inter cod.ng over a P -tyP i J^^^^^ tM preceding I or P image, 
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[0052] The third type o, i-ges* called Bar ^^^^^S^SL difference between the 
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of M (Mopt). on tp<5t cpnuences which are conventional in MPEG coding 
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[OT62] When the values of M. N and Q used in the course of the test coding are as follows: 
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M= 12, 
N = 3, and 
Q = 15, 

the values of M and N satisfy the following relationships: 

(1) N = INTp 89000 - PcoSt l +1. with 12 < N < 30. 

L ioooo J 

(2) N = |NT r i79000 - Bcost l + 1 < M < 7. 
L 20000 J 

' .' . Sy : 

15 [0063] Although, for formula (2) above, it has been indicated that M should lie between 1 and 7, it is seen on the 
diagram of Figure 7 that in fact M may be limited to 5. 

[0064] The layout intended for implementing the invention is represented in Figure 8. It comprises a first MPEG2 
coder 70 intended to carry out the test coding or coding "first pass". This test coding is set up with the fixed parameters 
indicated above, namely, in this example: M = 12, N = 3 and Q= 15. This test coder operates, in this example, in open 
20 loop, that is to say without regulation. 

[0065] The coder 70 supplies the values Bcost and Pcost which are applied to a converter 72 which carries out the 
conversions of Pcost into Nopt and Bcost into Mopt, as represented in Figure 6 and 7 and in accordance with the 
relationships (1) and (2) above. 

[0066] These values N and M are calculated for a group of images, as described above, and are then applied to a 
25 control input 76 of an MPEG2 coder 74. 

[0067] The data at the input of the coder 74 are the same as those of the input of the test coder 70. Hence a buffer 
memory 78 is provided to take account of the processing time in the test coder 70 and in the converter 72, this memory 
78 holding the data during the processing. 

[0068] In the converter 72, it is also checked that the N, M pairs resulting from the formulae (1 ) and (2) are compatible 
30 with the constraints imposed in the embodiment, in particular that M is a sub-multiple of N. If the values resulting from 
the calculation are not compatible, values of N and of M are adopted which are closest to those which were calculated, 
favouring M values, however. 

[0069] The converter 72 also takes account of supplementary conditions. 

[0070] In the first place, it performs a comparison between Bcost and Pcost and, if Bcost is higher than Pcost, the 
35 value 1 is allocated to the number M, the GOP containing no B image. This is because, with this assumption, the B 
images entail a coding cost which is higher than the P images; it is preferable to keep only P images which exhibit a 
higher prediction quality. 

[0071] In the second place, the converter compares Bcost with the value 179000 and, if Bcost exceeds 179000, 
relationship (2) above is replaced by the heuristic relationship below: 



40 



45 



(3) M = 5.1NT 



i£^-ll,with1<M<7. 

_B cos t J 



[0072] The converter 72 also makes it possible to take account of two special cases for which it is necessary to 
depart from relationship (2) in order to obtain uniformity of image quality. 

[0073] The first case is as follows: the lest coding shows that M should exhibit a value at least equal to 2, but, 
moreover, this test coding also shows that the intermediate values obtained by M are equal to 1 over a large part of 

50 the group, for example at least 80%. In this case, the converter 72 stipulates that M be equal to 1. 

[0074] The second case is similar to the first one: the test coding shows that M should be at least equal to 2, but the 
intermediate values obtained for M are equal to 1 for least a part of the length of the group, for example 60% (this limit 
is below the limit anticipated in the first case), and the preceding group is such that M = 1 . In this case, the value 1 is 
also conferred on the number M. 

55 [0075] These two special cases, for which the value 1 is set for M, result from tests carried out in the context of the 
invention which have shown that these conditions allow a good uniformity of quality, for the same type of sequence, 
over successive groups. 

[0076] Finally, the converter 72 takes account of the changes of scene or "cuts" which are usually detected in the 
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(Pcost) characterizing the P images obtained during the test coding and a parameter (Bcost) characterizing the B 
images obtained during the test coding are determined. 

5. Method according to Claim 4, characterized in that the number N is determined on the basis of the parameter 
s characterizing at least one P image, and in that the number M is determined on the basis of the parameter char- 
acterizing at least one B image. 

6. Method according to Claim 4 or 5, characterized in that the parameters (Pcost, Bcost) characterizing the P and B 
images are the costs of coding the P and B images, for example the average costs. 

10 

7. Method according to Claims 5 and 6, characterized in that, when the average cost (Bcost) of coding each B image 
is, in the course of the test coding, higher than the average cost (Pcost) of coding each P image, the value 1 is 
conferred on the number M, the group thus containing no B image. 

is 8. Method according to Claim 6 or 7, characterized in that, in the course of the test coding, the cost of coding each 
B image and the corresponding number M is determined in step with the arrival ot the source images. 

9. Method according to Claim 8, characterized in that, when the M numbers determined before the end of the test 
coding are equal to 1 for a significant fraction of the group, the value 1 is conferred on the number M for the group. 

20 

10. Method according to Claim 8 or 9, characterized in that, when the M numbers determined before the end of the 
test coding are equal to 1 for at least a defined fraction of the group, and when the number M is equal to 1 for the 
preceding group, the value 1 is conferred on the number M for the group. 

25 11. Method according to any one of the preceding claims, characterized in that, in the event of a change of scene 
occurring in a group, the new scene constitutes the I image of a new group, the affected group being shortened 
so as to stop before this new scene if the change ol scene occurs in the affected group, at a distance from the 
start at least equal to the minimum number allowable for N, the start of the affected group being used to lengthen 
the group which precedes it when the sum of the number of images preceding the change of scene in the affected 

30 group, and of the number of images of the group which precedes it does not exceed the maximum admissible for N 

12. Method according to any one of Claims 1 to 9, characterized in that, in the event of a change of scene occurring 
in a group, the new scene constitutes the I image of a new group, the affected group and group which precedes 
it being rearranged so that each exhibits a length close to the average of the length of the group after alteration 

35 and of the group which precedes it. 

13. Method according to Claim 6, characterized in that with the test coding being carried out according to an MPEG- 
type standard at 50 Hertz, with N = 12, M = 3 and Q = 15, the numbers N and M are a function respectively of the 
average costs of coding B and P images according to the following relationships: 

40 

(1) N = , NT r3890Q0 - Pcostl 12 < N < 30 

v ' L 10000 J 

45 

179000 - Bcost 1 +1 , with 1 < M < 7, 

20000 J 

50 

INT signifying integer part. 

14. Method according to Claim 13, characterized in that 1 < M < 7. 

55 1 5. Method according to Claim 1 3 or 1 4, characterized in that, when the cost of coding (Bcost) is greater than 1 79000, 
the number M is determined by the following relationship: 



(2) N = INT 
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